Search results for "Document retrieval"
showing 10 items of 10 documents
Discovering temporal relationships in databases of newspapers
1998
This paper is mainly dedicated to analyse the problem of discovering frequent temporal patterns in event sequences extracted from a large repository of newspapers. The proposed formalism and algorithms rely on Toodor, which is a document retrieval system that allows users to specify conditions over the structure, contents and temporal features of the stored documents. We develop in this work several algorithms for recognising frequent temporal patterns in terms of arc-consistency, which consist of discarding temporal occurrences that do not satisfy a temporal structure.
ExtMiner : Combining multiple ranking and clustering algorithms for structured document retrieval
2006
This paper introduces ExtMiner, a platform and potential tool for information management in SMEs (small & medium-size enterprise), or for organizational workgroups. ExtMiner supports interactive and iterative clustering of documents. It provides users with a visual cluster and list views at the same time, supporting iterative search process. ExtMiner may also be applied as a platform for research on retrieval fusion, since it combines search, clustering and visualization algorithms. ExtMiner was evaluated with three document collections. Although the findings were encouraging the user interface and performance with large document repositories need further development. peerReviewed
Some Results Using Different Approaches to Merge Visual and Text-Based Features in CLEF’08 Photo Collection
2009
This paper describes the participation of the MIRACLE team at the ImageCLEF Photographic Retrieval task of CLEF 2008. We succeeded in submitting 41 runs. Obtained results from text-based retrieval are better than content-based as previous experiments in the MIRACLE team campaigns [5, 6] using different software. Our main aim was to experiment with several merging approaches to fuse text-based retrieval and content-based retrieval results, and it happened that we improve the text-based baseline when applying one of the three merging algorithms, although visual results are lower than textual ones.
Metadata-Oriented Language Model in Translingual Retrieval of Digital Data
2015
Translingual retrieval relies on processing a source language to retrieve digital document content in a target language. From the perspective of successful browsing digital catalogues, probability of retrieving the full text document in a language other than the query language is close to zero owning to the fact that it is not only the library collection, but especially a problem of matching the index terms with the query keywords which are assumed to be their translation equivalents. In addition, hardly any digital library system is incorporated with a translation component. As a result, such a matching is rather coincidental. Our approach to the translingual document retrieval problem is …
A Concurrent Neural Classifier for HTML Documents Retrieval
2003
A neural based multi-agent system for automatic HTML pages retrieval is presented. The system is based on the EαNet architecture, a neural network having good generalization capabilities and able to learn the activation function of its hidden units. The starting hypothesis is that the HTML pages are stored in networked repositories. The system goal is to retrieve documents satisfying a user query and belonging to a given class (i.e. documents containing the word “football” and talking about “Sports”). The system is composed by three interacting agents: the EαNet Neural Classifier Mobile Agent, the Query Agent, and the Locator Agent. The whole system was successfully implemented exploiting t…
Multimedia Retrieval in a Medical Image Collection: Results Using Modality Classes
2013
The effective communication between user and systems is one main aim in the Multimedia Information Retrieval field. In this paper the modality classification of images is used to expand the user queries within the ImageCLEF Medical Retrieval collection provided by organizers. Our main contribution is to show how and when results can be improved by understanding modality-related challenges. To do so, a detailed analysis of the results of the experiments carried out is presented and the comparison between these results shows that the improvement using modality class query expansion is query-dependent.
A neural multi-agent based system for smart html pages retrieval
2003
A neural based multi-agent system for smart HTML page retrieval is presented. The system is based on the EalphaNet architecture, a neural network capable of learning the activation function of its hidden units and having good generalization capabilities. System goal is to retrieve documents satisfying a query and dealing with a specific topic. The system has been developed using the basic features supplied by the Jade platform for agent creation, coordination and control. The system is composed of four agents: the trainer agent, the neural classifier mobile agent, the interface agent, and the librarian agent. The sub-symbolic knowledge of the neural classifier mobile agent is automatically …
Visual Re-Ranking for Multi-Aspect Information Retrieval
2017
We present visual re-ranking, an interactive visualization technique for multi-aspect information retrieval. In multi-aspect search, the information need of the user consists of more than one aspect or query simultaneously. While visualization and interactive search user interface techniques for improving user interpretation of search results have been proposed, the current research lacks understanding on how useful these are for the user: whether they lead to quantifiable benefits in perceiving the result space and allow faster, and more precise retrieval. Our technique visualizes relevance and document density on a two-dimensional map with respect to the query phrases. Pointing to a locat…
Context-aware summary generation for Web pages
2009
General purpose search engines provide users with lists of retrieved documents in response to their queries. The common structure of list elements includes the title of a document, its URL, and small snippet from the text. Snippets are evidence of occurrences of query's keywords in the document. The length of each snippet is just a couple of lines. They cannot play a role of summaries of retrieved documents: In many cases, they are not indicative and users cannot judge on the relevancy of documents. In our approach we use ontology as context description and that ontology will be used to describe user's main interest with respect to wanted summary and help to select weighting of key words an…
Flexible entity search on surfaces
2016
Surface computing allows flexible search interaction where users can manipulate the representation of entities recommended for them to create new queries or augment existing queries by taking advantage of increased screen estate and almost physical tactile interaction. We demonstrate a search system based on 1) Direct Manipulation of Entity Representation on Surfaces and 2) Entity Recommendation and Document Retrieval. Entities are modeled as a knowledge-graph and the relevances of entities are computed using the graph structure. Users can manipulate the representation of entities via spatial grouping and assigning preferences on entities. Our contribution can help to design effective infor…